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Abstract 

Let us consider a pair signal-observation ((x n ,y n ),n > 0) where the unobserved 
signal (x n ) is a Markov chain and the observed component is such that, given the 
whole sequence (x n ), the random variables (y n ) are independent and the conditional 
distribution of y n only depends on the corresponding state variable x n . The main 
problems raised by these observations are the prediction and filtering of (x n ). We 
introduce sufficient conditions allowing to obtain computable filters using mixtures of 
distributions. The filter system may be finite or infinite dimensional. The method is 
applied to the case where the signal x n = X n ^ is a discrete sampling of a one dimensional 
diffusion process: Concrete models are proved to fit in our conditions. Moreover, for 
these models, exact likelihood inference based on the observation (yo, . . . , y n ) is feasable. 
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1 Introduction 



Let us consider a pair signal-observation ((x n ,y n ),n > 0) where the unobserved signal (x n ) 
is a Markov chain and the observed component is such that, given the whole sequence 
(x n ), the random variables (y n ) are independent and the conditional distribution of y n only 
depends on the corresponding state variable x n . This is a classical setting in the field of non 
linear filtering and the process (y n ) is often called a hidden Markov model. 

In this context, a central problem that has been the subject of a huge number of con- 
tributions is the study of the exact filter, i.e. the sequence of conditional distributions of 
x n given y n , . . . , yi,yo, n > 0. On the other hand, statistical inference based on the (non 
Markovian) observations (yo, . . . ,y n ) requires the knowledge of the successive conditional 
distributions of y n given ... ,yo. These are obtained through the prediction filter, i.e. 
the sequence of conditional distributions of x n given y n _i, . . . , y\, 2/0 • Although the exact and 
the prediction filter may both be calculated recursively by an explicit algorithm, iterations 
become rapidly intractable and exact formulae are difficult to obtain. To overcome this diffi- 
culty, authors generally try to find a parametric family T of distributions on the state space 
X of (x n ) (i.e. a family of distributions specified by a finite fixed number of real parame- 
ters) such that if C(x ) G J 7 , then, for all n, C(x n \y n , . . . ,2/1,2/0) and £(x n |y n _i, . . . ,2/1,2/0) 
both belong to T '■ In this case, the model ((x n ,y n ), J-) is called a finite-dimensional filter 
system and it is enough to describe each conditional distribution by the parameters that 
characterize it. This situation is illustrated by the linear Gaussian Kalman filter (see below 
Section Whenever the initial distribution of the signal is Gaussian, specified by its mean 
and variance, then, all the successive conditional distributions are Gaussian and there is an 
explicit recursive algorithm which gives the stochastic process of the conditional means and 
variances. 

Necessary and sufficient conditions for the existence of finite-dimensional filters in dis- 
crete time have been given in Sawitzki (1981) (see also Runggaldier and Spizzichino (2001)). 
The case of continuous time filters was treated in Chaleyat-Maurel and Michel (1984). As 
a consequence of these papers, it appears that very few finite dimensional filters are avail- 
able and they are often obtained as the result of an ad hoc construction (see e.g., the new 
constructive approach presented in Ferrante and Vidoni (1998) and the references therein). 

In what follows, we propose a method to obtain computable filters that may be finite 
or infinite dimensional. Our approach is a generalization of the one developped in Genon- 
Catalot (2003) and Genon-Catalot and Kessler (2004) for a special model. The method is 
well fitted for the filtering of discretized diffusion processes and illustrated with examples. 

More precisely, (Section EJ) we consider at first a sequence of parametric families of 
distributions J 71 = {v 1 q,0 £ ©}, i £ N, where O C M p is a parameter set. Then, we construct 
an enlarged family by means of mixtures. Let us define the set S of mixture parameters: 



oo 



S = {a = (pi.i, i > 0), Vi > 0, ai > 0, a i = !}• 



(1) 



i=0 



Then, we set: 




> 0) G 5, 6 e 0}. 



(2) 
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Each distribution v = vq a in the above class is thus specified by an usual parameter 9 and 
a mixture parameter a. = (aj, i > 0). We give sufficient conditions on the class T = Ui^J 71 
ensuring that if v = C(xq) belongs to T ", then, the exact and the prediction filter both evolve 
within T . These conditions involve the conditional distribution of y n given x n = x and the 
transition operator of the hidden chain (x n ). Of course, the most interesting case is when 
the mixture distributions obtained for the filters have a finite number of components. To 
this end, we introduce the sub-class Tf of T composed of distributions v = VQ a such that 
on = for i greater than some integer N. We give sufficient conditions ensuring that, when 
C{xq) G Tf, then, the exact and the prediction filter evolve in Tf (Conditions (C1)-(C2) 
and Theorem 12. They are thus specified by a finite number of parameters. This finite 
number may change along the iterations. Nevertheless, the filters are explicitly and exactly 
computable. Let us note that our method has links with the one developped in Di Masi 
et al (1983). In this paper, the distributions of the filters are allowed to be finite linear 
combinations of parametric distributions. The number of terms in the linear combination 
may also change along the iterations. However, the coefficients in each linear combination 
are possibly negative. So these distributions are not mixtures of parametric distributions 
and their interpretation is therefore difficult. 

In Section |2J we illustrate our method on the classical Kalman filter with non Gaussian 
initial distributions. We introduce an appropriate class of non Gaussian distributions and 
give the corresponding explicit formulae for the filters. In Sections I4I5I6I wc consider models 
satisfying our sufficient conditions and for which the signal x n = X n & is a discrete sampling 
of a one-dimensional diffusion process. In Section 21 we study the observation equation 
Vn = x n w n where the signal (x n ) and the noise (w n ) are independent sequences of positive 
random variables. The signal x n is a discrete sampling of the diffusion process given by the 
stochastic differential equation 

dX t = (9X t + {5 ~l )a2 )dt + adflt, (3) 

where (fit) is a standard one-dimensional Brownian motion, 9 is a real parameter, a is 
positive and 5 is a real number such that 5 > 1. This process is called the radial Ornstein- 
Uhlenbeck process. In the case 5 = 1, we consider the absolute value of a one dimensional 
Ornstein-Uhlenbeck process. The noises w n have a specific distribution to ensure explicit 
formulae. In Sections and |HJ we exploit the results of Section0]to develop models based on 
the same signal but different observation equations. In particular, in Section [SJ we propose 
some stochastic volalility type models. Section contains some concluding remarks. Some 
technical proofs are given in the Appendix. 

2 Sufficient conditions for computable filters. 
2.1 The filtering- prediction algorithm. 

We introduce our notations in an abstract framework that will become concrete through the 
examples below. 

We denote by X the state-space of the unobserved Markov chain (x n ), which is equipped 
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with a sigma-field B. We assume that this chain is time-homogeneous and denote by P its 
transition operator. The time-homogeneity assumption is here for the sake of simplicity of 
notations. It is not essential and may be relaxed (see Section [7J). 

The state-space of the observed component (y n ) will be denoted by y and its sigma-field 
by C. We assume that the conditional distribution of y n given x n = x does not depend on 
n and that this distribution is given by a density with respect to a common dominating 
measure /lonj: 

F x (dy) = C(y n \x n = x) = f x (y)fi(dy). (4) 

As stated above, this is also the conditional distribution of y n given the whole sequence 
(xfc) when x n = x. Again, the time-homogeneity assumption here is not essential. On the 
contrary, the existence of a common dominating measure /i for the family of distributions 
(F x (dy),x G X) is essential for the filtering-prediction recursive equations. In the statistical 
vocabulary, it means that this family of distributions is a dominated family when x is 
considered parameter. 

In the concrete models that we investigate in further sections, we shall take X equal to 
R or (0, +00), and y equal to R, (0, +00) or N. 

Let us now briefly recall the filtering-prediction algorithm (see e.g. Del Moral and 
Guionnet (2001)). Given an initial distribution for xq, there is a well known algorithm that 
allows to compute the successive conditional distributions: 

. . updating . . , prediction . . , 

L{x ) ► C(x \y ) * C{xi\y ) 

updating „ , , . prediction - , , > 

> £(xi|yi,y ) * C(x2\yi,yQ) .... 



By the above iterations, we get two kinds of distributions on X: 

Vn\n:0 = £>{Xn\yn, ■ ■ ■ ,Uo), (5) 

"n+i\n:0 = C(x n+1 \y n , . . . ,y ), (6) 
The distribution (jHJ) is called the optimal or exact filter and © is the prediction filter. 

These are obtained using two steps: the updating and the prediction steps which can be 
described by introducing the following operators. Let V{X) denote the set of probability 
measures on X. For v G V(X), the probability ^> y {y) is defined by (see @): 

fyWdx) = , (7) 

(with the convention that 0/0 = 0), where the denominator is equal to 

Mv) = I v{dOU{y). (8) 
JX 

The operator <p y is called the up-dating operator which allows to take into account a new 
observation. 
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On the other hand, the prediction operator is as follows. For v € V(X), the probability 
measure 

f(v) = vP (9) 

is obtained by applying the transition operator of the hidden Markov chain. It is defined 
by: 

i>{y){dx ) = vP(dx) = [ u(dx)P(x,dx'). (10) 

Jx 

We have, for n > 0, (with vo\-i-.o = = £>{xq)) 

v n\n:0 = fyn { v n\n-l:o)i v n+l\n:0 

The optimal filter is obtained via the operator 
and 

^n+l|n+l:0 = ®y n +l( u n\n:o) = $y n +i ° ■ 

The prediction filter is obtained via the operator 

$y = l/> ° <Py, (14) 

and 

^n+l|n:0 = % n {^n\n-l:o) = ° ■ ■ ■ ^("o). (15) 

Moreover, the conditional density of y n given (y n -i, ■ ■ ■ , yo) is obtained as the following 
marginal density (see (jHJ)): 

p(y\y n -i,...,yo) = p» nln _ 1:0 (y)- (16) 

And the exact density of (yo, Ui, ■ ■ ■ > Vn) is obtained as the product of the successive condi- 
tional densities p(y;|yj_i, . . . , y ). 

2.2 Sufficient conditions. 

The iterations above are rapidly untractable unless both operators and © evolve in a 
parametric family of distributions, i.e. distributions specified by a fixed finite number of real 
parameters. In what follows, this number of parameters will possibly vary along iterations. 

More precisely, let us define a class T of distributions on X as follows. First, we start 
with a parametric class of the form 

F = {vi,0e 9,!GN} (17) 

where C MP is a parameter set. Each distribution in T is thus specified by a couple 
(i,0) 6 N x 6 and there is a one-to-one correspondence between N x and the class T . 



= ^("n|n:o)- ( U ) 

(12) 

( 13 ) 
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Then, using the set S of mixture parameters defined in l[T|). we build the enlarged class 
composed of convex combinations of distributions v\ having the same parameter 6: 

oo 

f = {u; v = on, v\ = v g>a , 6 £0,a = (a h % > 0) G S}. (18) 

i=0 

Now, each distribution vq a on T is specified by a parameter 9 and a mixture parameter a. 
We stress the fact that all components in a given mixture have the same parameter 9. The 
mixture parameter a of a VQ^ a may or may not depend on 9. For 

a = a {i) given by af = 1, a f=0,j^i, (19) 

we get the distribution v % Q : 

4 = v e, a w- ( 2 °) 

Obviously, T C T . But the resulting extended class may be considerably larger. Of course, 
the number of components in the mixture can be finite. So, we shall define the length of a 
mixture parameter a by 

1(a) = sup{i; a, > 0}. (21) 
We define the sub-class of distributions with finite-length mixture parameter by 

T f = {v = u e , a G F, Ka) < oo}. (22) 

On the other hand, when a has infinite length, the series defining an element v in (|18|) may 
have an explicit sum, which will be another expression of v. 

Now, we want conditions such that, for v € <fy(v) and ^(f) both belong to T. In such 
a case, it will be enough to express both operators in terms of the couple (9, a) specifying 
the distributions in T . Moreover, when the two operators evolve within Tf, then the exact 
and the prediction filters are exactly computable even if the number of mixture components 
varies along the iterations. 

Let us consider the following conditions. 

• (CI) For all y G y, for all v G J 7 , (p y (u) G T (see Q). More precisely, for all (i, 9) G N x 9, 

<Pv04) = u T y { (jy 

with T y {9) G O, t y : N — ► N a one-to-one mapping and (9, y) — > T y (9) measurable. 

• (C2) For all v G J 7 , = vP G J- . More precisely, for all (i, 9), may be written as 

i>o 

where G S, t(9) G O and 9 ->■ (r(6»), a('' e )) measurable. 

(C2-f) For all is £ J 7 , ^(v) = vP G J-f, with, using the notations of (C2), for all (i,9), 
l(a^) = L(i) < oo and the mapping i — > L(i) is non decreasing. 
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• (C3) For all x 6 X, P(x, dx') belongs to the class and may be written as 



P(x,dx') = J2<Xi(x>k(dx'), 



i>0 



where a°(x) G S, 9q E and x — ► aP(x) is measurable. 

Let us make some comments about these conditions. Condition (CI) concerns only ip y 
and the class T ' . The up-dating operator <p y becomes the following mapping from T to T: 



Note that, in condition (CI), the function t y (.) must not depend on 9. Thus, the class T 
is a conjugate class of distributions for the parametric family F x (dy) = f x (y)dfj,(y) (in the 
sense of Bayesian estimation). Conditions (C2)-(C2-f)-(C3) concern the transition operator 
P and the class T . Condition (C3) implies that, when the signal starts at a fixed xq = x, 
then, the distribution of x\ belongs to the enlarged class T . Therefore, we can consider that 
Dirac measures belong to the enlarged class or directly add all Dirac measures to this class. 
Conditions (C2-f) and (C3) may appear contradictory. Actually, this is not the case because 
a distribution v in T may have two different representations, i.e. the equality uq a = vg> a > 
does not imply (9, a) = (9', a'). Moreover, one representation may be finite and the other 
infinite. We discuss this point in Section 

We have the following result. 

Theorem 2.1. 1. Assume (Cl)-(CB). If v belongs to T (see then, y> y (y) and 

t/j(u) both belong to T. More 'precisely, if u = ug a then, 

tp y (v) = v T y{e),a y {e, a ) and ip{v) = v T (e),b(o,a) 

where T y (8),r(0) are defined in Conditions (C1)-(C2) and the mixture parameters 
a y (9,a) and b{9,a) are given in formulae \29j) and \31\) or iSS\) . 

2. Assume (Cl)-(C2-f). If v belongs to Tf (see 1122]) ). then, tp y (y~) and both belong 



Proof. Consider first the up-dating operator <p y (see Pj)-f7j)-(|8]l). Let v = ug >a = J2i>o a i v e 
Then (see ©), 



(i,9)^(t y (i),T y (9)). 



(23) 



toTf. 




(24) 



j>0 



with, for all i 




(25) 



We have 




(26) 



Using (CI), since the mapping t y is one-to-one, we get 



f x (y)is(dx) = Y, a i Pui(y)^T y %( dx ) = Y, a ty\j) P u t~Hf)(yy Ty {e)^ dx ) 



(27) 
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Hence, 

<Py(ve,a) = VT y (0),a y (e,a), (28) 

where the parameter T y {6) is defined in (CI) and the mixture coefficient a y (9,a) is given 
by (see (JUJ)): 

°v(M)i = ^ • (29) 

The operator ip y on T can be expressed in terms of the parameters by the mapping: 

(fl,a)e0xS-» (T y (9),a y (9,a)) £0xS. (30) 

By (|29|). the mixture parameter a y (9, a) has finite length when a has finite length. 
Consider now the prediction operator ip (see ©)• By linearity and (C2), we get 

«>o i>o 

where t(9) defined in (C2) and the new mixture parameter is obtained by interchanging 
sums and is given by 

b(9,a) j = ^2a i af' 6 \ (31) 

j>0 

The prediction operator tp is therefore now given defined by the mapping: 

(0,a)-(T(0),&(0,<*)). (32) 
Now, if 1(a) = p, and (C2-f) holds, then 

p 

b(9,a)j = ^2 a i a j \j<L(i)) = ^2 ( 33 ) 
i=0 i>£ _1 (j),i<P 

where L~ l (j) = inf{i;L(i) > j}. Since = for i > p, b(9,a)j = as soon as L _1 (j) > 
p. □ 

Remark. 

1. It is worth noting that our conditions imply that the the parameters T y (9),r(9) only 
depend on 9 whereas a y (9,a),b(9,a) depend on (9, a). 

2. For x € X, it is immediate to check that f y (5 x ) = 5 X . By (C3), ip(5 x ) = z^ ,»°(»- So 
the algorithm starting with a deterministic initial condition evolves in T. 
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3 The Kalman filter with non Gaussian initial condition. 



Our first example is based on the classical and simplest standard one-dimensional Kalman 
filter. It is well known (see e.g. Makowski (1986)) that, whatever the initial distribution for 
the Kalman filter, it is possible to compute the prediction and exact filters. We illustrate 
this property through a special family of initial distributions. Let us recall the model. The 
observation equation is given by 

y n = hx n + ~fw n , (34) 

with h, 7 constants (7 > 0), (u> n ) a standard one-dimensional Gaussian white noise. And 
for the signal 

x n = axn^x +(3rj n , (35) 

with a, (5 constants {(5 > 0), (r/ n ) a standard one-dimensional Gaussian white noise. The 
sequences (x n ) and (w n ) are assumed to be independent. Now, the conditional distribution 
of y n given x n = x is 

f x {y)dy=M{hx, 1 2 ). (36) 
And the transition operator of (x n ) is 

P(x, dx') = p(x, x')dx' = M(ax, (3 2 ). (37) 

We introduce below a class of non Gaussian distributions and show that our conditions 
(C1)-(C3) hold for this class. Therefore, can be explicitely computed. Before doing 

this, we recall the classical case. 



3.1 The standard Kalman filter. 



It is well-known that if the initial distribution is Gaussian (or deterministic) then, for all n, 
the distributions © and © are Gaussian. Let us denote by Q = {M(m, a 2 ), m £ R, a 2 > 0} 
the class of Gaussian distributions. The up-dating and prediction operators are from Q onto 
Q. And, some classical and elementary computations yield: 

• The up-dating step is: u = N(m, a 2 } — > (f y {v) = AA(m, a 2 ), with 

A _ m 7 2 + hya 2 „ 2 _ a 2 ^ 2 

• The marginal distribution is: Pu{y)dy = J\f{hm,^ 2 + h 2 a 2 ) 

• The prediction step is: v = J\f(yn, a 2 ) — > ip(v) = Af(fh, a 2 ) with 

fh = am, a 2 = (3 2 + a 2 a 2 . (39) 



The formulae above hold true when a = allowing to include the case of Dirac measures. 
Note that, for all x, P(x,dx') = N(ax,f3 2 ) also belongs to Q. 



8 



3.2 An extended Kalman filter. 

Now, we enlarge the class of Gaussian distributions using new distributions and mixtures. 
Consider three parameters /J,, m, a 2 with fi, m € M and a 2 > 0. For i = 0, set 

"(o,m^)(dx)=Ar(m,a 2 ). (40) 

and T = Q. For i > 1, set 



w-)^) = cil + /L 2 ) " ( ° ^ 2)(dx) ' (41) 

where the normalizing constant is given by 

Cxim + wo 2 ) =E((aX + fi + m) 2i ), (42) 
for X a standard Gaussian random variable. Let denote by 

C 2l = E(X 2 *) = ^, (43) 
the 2i-th moment of X. Then, for < k < i, some elementary computations yield: 



*V»-M). («) 



We deduce 



C 2i (m + M ; a 2 ) = J> + M ) 2 V 2 ^ fe )^i (*) . (45) 

k=0 2fc V / 

Set^ = {i/ ( ^ m(T2) ,(/i,m,(j 2 ) GMxMx (0,+oo)}. Define T = Ui>o & ■ 

In the Appendix, we study some elementary properties of these distributions. 

The class T is defined as in (|18|). All distributions in .F have density with respect to a 
Gaussian law. 

Now, we check conditions (Cl)-(C2-f)-(C3). Condition (C3) evidently holds (see (1371) ) 
since T contains all Gaussian distributions. By the following proposition, condition (C2-f) 
holds. 

Proposition 3.1. Consider the model given by ffify-pH fy -pIf fy -tfFfy. 

1. For i = 0, and v = J\F(m, a 2 ), <Py(v) = M(rh, a 2 ) with rh, a 2 given in 

2. For i > 1, and v = "{^^(dx) in F, <p y (v) = v^^y 

Condition (CI) holds with t y (i) = i, for all i > and T y (fi,m,a 2 ) = (pi,rh,a 2 ) (where for 
i = 0, ft = 0). 
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The proof is obtained in the same way as for the up-dating step for the classical Kalman 
filter (see the Appendix). Now, looking at formulae (JSJ) and (JTDJ), since f x (y) and p(x,x') 
in this model are both Gaussian kernels, the computation of marginal distributions and the 
checking of (C2-f) are identical up to a change of notations. The results are given in the 
following proposition. 

Proposition 3.2. Let v = v\ 2\(dx) belong to J 71 . 



1. Then 



M= . ( 47 ) 



(Xi, 



^) = E4^W)> ( 46 ) 

k=0 

with m, a 2 given in \H!J\). 

m(3 2 + fia 2 
a a 2 

and for k = 0, . . . , i, 

3=0 

with 

C^2 % k=o^k = Hence, Condition (C2-f) holds with T(fi,m,a 2 ) = (fl,m,a 2 ), for 

k = 0, . . . , i, a^' r 
v G T % is L(i) = i. 



k = 0,...,i, Q,^'^' m '°" ) = a W anc j: ^ e i en gth j fog mixture parameter of ip{v) for 



2. The marginal distribution p u (y)dy is given by the same formula as \4°\ ) with (a,(3 2 ) 
everywhere replaced by (h,^ 2 ). 



The proof is given in the Appendix. Note that, using 2^- = 1 — we have 

( P 2 

m + u = aim + a) 1 — To 
V a* 

Remark. It is worth noting that, in this model, the number of parameters remains fixed 
along iterations: If the initial condition is specified by parameters (/i, m, a, a) with 1(a) = p, 
i.e. 3+p + 1 parameters, then the length of the mixture parameter will always be equal to p 
and the number of parameters will remained fixed equal to 3 + p + 1 . This is not surprising 
since the Kalman filter is a finite-dimensional filter, even when the initial condition is non 
Gaussian (see e.g. Makowski (1986)). 




4 Scale perturbation of a radial Ornstein-Uhlenbeck process. 

In this section, we consider multiplicative perturbation models of the form 

y n = x n w n (50) 
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where (w n ) is a sequence of i.i.d. positive random variables and (x n ) is also a positive 
signal independent of the sequence (w n ). The multiplicative structure comes from the field 
of Finance with the so-called stochastic volatility models. However, in stochastic volatility 
models, the noises are standard Gaussian variables (see the next section). The advantage 
of positive signal and noise is that we can interpret the model as a scale perturbation of a 
positive signal. 

Now, we consider a signal which is a discretization of the radial Ornstein-Uhlenbeck 
process. And, for the noise, we consider positive random variables with a specific distribution 
and build a class T such that conditions (Cl)-(C2-f)-(C3) hold. 

4.1 The signal 

We assume that 

X n = InA (51) 

is a discretization of a continuous time diffusion (Xt) equal to a radial Ornstein-Uhlenbeck 
process. We recall its definition and properties. 

4.1.1 The one-dimensional radial Ornstein-Uhlenbeck process. 

Consider the one-dimensional Ornstein-Uhlenbeck process given by: 

& = & + f eZ s ds + aW t , (52) 
J o 

where (Wt) is a standard Brownian motion and the initial variable £o is independent of (Wj). 
Then, 

tt = ^e et + a f e e ^- s Uw s . (53) 
Jo 

Let Xt = |£t|. Then, a simple computation shows that the conditional distribution of Xt 
given £o = £ only depends on x = |£| so that (X t ) is a Markov process. We may call it 
the one-dimensional radial Ornstein-Uhlenbeck process. Let us now give the conditional 
density of x n = X n /\ given x n _i = X/ n _x\&, i.e. the transition density of (x n ) (see (|51)0. 
For A > 0, we set 

26»A _ i 

U n = ^nA and a = e eA , f3 2 = a 2 ^ . (54) 

Then, as can be easily deduced from (|53|) . (U n ) is a standard AR(l)-process satisfying 

U n = aU n -i + /3rj n , n > 1, U = £ (55) 

where (r] n ,n > 1) is a sequence of i.i.d. random variables having distribution jV(0, 1). Now, 
x n = \U n \ = X n /± is a Markov chain having transition density (for positive x) 

p {1 \x,x') = (p(x,x r ) + p(x,-x'))l {x , >0) , (56) 
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where p(u,u') is the transition density of (U n ), i.e. 

I I \ 1 (V ~ aU ) 2 /V-\ 

"^^w^^—^- (57) 

A simple computation shows that (with x > 0) 

2 X^ Qp 1 X^ 1 ( CLXX^ \ 

p w (x,x') = l (i ., >0) ^=exp(-^2) exp(-^-) ^cosh(-^-)J . (58) 

Now, using the series expansion of cosh, we obtain a representation of this transition density 
as the following mixture of distributions: 



2 x' 2 x' 2i 

pW(x,x') = l [xl>Q) —j=^ v {-—) ^ ai ( x )__ ; (59) 



Oi(x) = exp (-- — w ) 77. (60) 



where Cn = ^qr is the 2i-th moment of a standard Gaussian variable (see l|43|)). and for 

i > 0, 

a 2 x 2 / a 2 x 2 V 1 

For 0<O(O<a<l), the process (x n ) has a stationary distribution given by 

2 x 2 

7r (1) (dx) = l (a . >0 )— 7 = exp (-—=■) dx (61) 

with (see (JMJ)) 

P = W = 2|0j" (62) 
These formulae will be the useful tool for the construction of the class T below. 



4.1.2 The 5-dimensional radial Ornstein-Uhlenbeck process. 

For 5 > 1, consider the stochastic differential equation 

dX t = (9X t + a2 ^~ l) )dt + crdflt, X = V . (63) 

where (/3t) is a standard Brownian motion and rj is a random variable independent of (/3t). 
The values 9, a, 5 are constant parameters. This process is called a radial Ornstein-Uhlenbeck 
process. This is due to the fact that, when 5 is an integer greater than 1, then Xt is the 
Euclidian norm of a <5-dimensional vector (£*, . . . , £f) whose components are i.i.d. Ornstein- 
Uhlenbeck processes satisfying: 

d$ = eddt + adW^. 

Moreover, when (Xt) is given by (|63j) . the process Rt = X 2 is the classical Cox-Ingersoll-Ross 
diffusion model given by: 

dR t = (26R t + 5a 2 )dt + 2<TR l t /2 df3 t . (64) 
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The processes (Xt) and (Rt) have explicit transition probabilities with densities with respect 
to the Lebesgue measure on (0, +00). There are closed-form formulae for the transition 
densities when 5 is an odd integer. Otherwise, they depend on Bessel functions and have 
explicit developments as sums of series. As above, we set x n = X ni \ and give the expression 
of the transition operator of this Markov chain using the notations (|54|) . For a > 0, let us 
set 

C a = E\X\ a , (65) 
for X a standard Gaussian random variable. 

Proposition 4.1. 1. Assume 5 > 1. Then the transition density of the 5 -dimensional 
radial Ornstein-Uhlenbeck process (see lo'ty) ) is equal to (with x > 0): 



2 x' 2 x /S-i+2k 

p^(x,x') = l^ — e^C-—) ga fc (x)^_ i+2fc ^_ 1+2fc , (66) 



where the mixture coefficients are given by 16 (J\) and the couple (a,/? 2 ) is linked with 
the original parameters (0,a 2 ) through relations \5$ . 

2. Assume that 5 = 2n + 1 with n > 1 an integer. Define the operator T , acting on 
functions f & C 1 ((0, +00), M) ; by T(f)(x) = Then, the transition density of 

(x n = X n A), where (X t ) is the 2n + 1-dimensional radial Ornstein-Uhlenbeck process 
is equal to ( with x > 0) 

2 x' 2 a 2 x 2 x' 2n 

p( 2n+ >(x, x') = l( x i >0 )— y=exp(-^) ex P(-^2") [T n {cosh){z)] z= ^ 

"(67) 

where T n = T o T . . . o T is the n-th iterate of T . 

Moreover, the above formulae also hold for 5 = 1 (n = 0) (see 158\) - I5y\) ) with the 
convention that T°(f) = f . 

When 9 < (0 < a < 1), the process (Xt) and the Markov chain (x n ) have a stationary 
distribution equal to (see ()54 [) )-(|62 [) ) 

2 x 2 x^~^ 

TT^\dx) = l (x>0) — =exp(-— ^) dx. (68) 

PV27T *<P P W-l 

Details are given in the Appendix. 



4.2 Distribution of the noise 

Assume that, for all n, w n has the distribution of T~ 1 / 2 where T has an exponential distri- 
bution with parameter A > 0. Then, for all positive x, the distribution of Y = xw\ is given 
by: 

1\x 2 Xx 2 

F x (dy) = f x (y) dy, with f x (y) = ^-exp( — — ) l (0 ,oo)(y)- (69) 
It is worth noting that the distribution of wi (Fi(dy)) satisfies: 

E(| logt^il) < 00 and E(w[) < 00 if and only if r < 2. (70) 
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Instead of an exponential distribution, we could take a Gamma distribution with integer 
index. 



4.3 The class of distributions. 



First, for i > 0, we set, for 6 > 1, 

9f\x) = Wo) jz " 1/9 t; exp(-^-), (71) 



2 x 5-l+2i x 2 

L (*>o) (27r)1/2 C5 _ i+2j 



where C a is defined in l|65|). Thus, gf^ is a probability density on (0, +oo). Then, for a > 0, 
we set 

^>(ds) = !,{<>(> = 1 (I>0) ^^-^-^e^f-l,) *. (72) 



For each i, the distribution u % J^ is equal to the distribution of y where is Gamma 
with parameters (i + 5/2, l/2cr 2 ). This Gamma distribution is identical to a a 2 x 2 {5 + 2i) 
(with non integer parameter). Hence, as i increases, the distributions {1% , i > 0) are 
stochastically increasing. Let us now define 

r ,(8) = { u US). a > = Ufe()<F i,W. (73) 

And, 

={v = u a>a = J>< « = G S > ^ i6) e ^' W > ' > 0}- (74) 

The law of \X\ for X a Gaussian variable with mean m and variance a 2 has the density 

x 2 v , m 2 , / ,mx. 



9{x) = \ x>Q )—j= exp(-^)exp(-^j) (^cosh(— Tj -)J . (75) 

A Taylor series development of the cosh yields a distribution of the class T^ 1 ' with mixture 
parameter 

77l 2 1 / 7Y1 2 \ 1 

Qi(m,<r) = ex P(- 2^2)^ ( ^2 J ' * - °- ( 76 ) 

All distributions in have a density with respect to a iv'^(cfcc) for some positive a 
(see (|72|l). And this density is expressed as an entire series of even powers. The transition 
density (|58|) and the stationary density (|61|) belong to . The transition density <|tj(jj) and 
the stationary density (|68|) belong to .T^ 5 ). Thus, our condition (C3) holds for the model 
defined by d3TJ) and (jS5|) . 



4.4 Up-dating, marginal and prediction operators. 

In this section, we show that the filtering and prediction algorithms evolve in the class 
when the signal is a discrete regular sampling of the <5-dimensional radial Ornstein-Uhlenbeck 
process. For this, it is enough to check conditions (C1)-(C2)-(C3). 
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We have already noted that condition (C3) holds. Actually, the form of the transition 
density Q66JI and of the stationary distribution (|68j) (when it exists) indicates how to define 
the class . 

Proposition 4.2. Let y > 0, and consider the up-dating operator ip y (see ^)) corresponding 
to f x (y) given in \6!A) . For i > and a > 0, with Ua defined in \7'^j , 



<p(u^) = u£^\ wtth T y (a) 



a y 



T B (a) > — *VVJ~ ( y 2 + 2A(7 2)l/2- 
Thus, t y (i) = t(i) = i+l and (CI) holds for (cp y ,^ s )). 

Proof. We have 

f x (y)gf\x/o) oc a*-i+mi) exp-((^ + 

y A a A 2 

Hence, we may define 

1 _ 2A 1 

Moreover, we have t y (i) = t(i) = i + l: if v € .P*' (<5) , then, ^(^) G _P +1 >( 5 ). So, we get the 
result. □ 



We also give the marginal distribution. 

Proposition 4.3. For i > and cr > 0, mt/i z4- defined in the marginal density 

(see is equal to 

p„*.w(v) = Vo^pf } (^) (77) 

with 

P * {V) ( y 2 + 2 )i+l+5/2 • 

Proo/. Using C 5 _i +2 (i+i) = (S + 2i)C S -i+2i, we get 

1 ( & )( x \ f ( \a ^o- 2 {S + 2i)y s ~ 1+2i 1/2 _j (5), y s 



□ 



Remark. Proposition 14.31 allows to obtain the density of (yo, J/i, • • • , 2m), i-e. the exact 
likelihood based on this observation. Indeed, this joint density is obtained as the product of 
the conditional densities of y% given yi-\, . . . , y%. These are computed as marginal densities 
(see (QSJ)). 



15 



Proposition 4.4. Consider the transition operator P^ with transition density \6b)) . Let 
v = 4' (<5) be given by (7§ with i > 0, i.e. 4' (<5) G P' {5) ■ Set vP^ = ip^(u). Then, 

^^)=±at ] ^ (79) 

fc=0 

with 

r 2 (a) = P 2 + a 2 a 2 , (80) 

and for k = 0, 1, . . . ,i, 

T/ms, L(i) = i and ^ 5 \v^ {5) ) belongs to . Condition (C2-f) holds for (P^,^). 
Proof. We have to compute 

/■OO 

A= / (l/<T)g^(x/a)p^(x,a/)dx, (82) 
with given in H71[) and p^)(x,x') given in (|66|) . Let us define s 2 by 

1 - ° 2 1 



Hence, for all k > 0, 

-,2^2 



Now, using s/(3a = l/r(a), A is given as the following expression 



2{i+k)+5 



A 



(84) 



eXp( "^^ E ' (85) 



r( C r)(27r) 1 /2 2/?2 ^ r 5-i( CT ) 

with 

Using (|5H|) and some computations, we get, for all k > 0, 

a 2\fc s 2(i+fc) / 02 y / /? 2 \ fc 

= l^vyj I 1 " ^viJ • (87) 

Now, we set 

c 2 /? 2 V r 2 (a) 
This yields 

^1 ( X > 2 \ k ( p 2 y ^j^-i 

Now, we use the following lemma whose proof is given in the Appendix. 



n=m U-db-l- (88) 
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Lemma 4.1. For all integer k > 0, and all i > n 

^T" 1 =£k(k-l)...(k- j + l)a j 

= a + km + k(k - l)a 2 + . . . + k(k - 1) . . . (k - i + l)^. (90) 

with, for j = 0, 1, . . . , i, 

23 



3 J C 2 j+s-i ' 



and the coefficient of oq is equal to 1. 

Now, we transform expression (|89|) into 



E = £ S * (91) 

i=o 



where 



(q2 \ ' ™ -i / 12 \ K 



fc=0 

But, fe(fe - 1) . . . (k- j + 1) = for k = 0, 1, . . . , j - 1. So, we get 
02 y / x /2x* ! 



1 / p* \ fi\ fx' 2 \ 3 . x' 2 



-*\ exp(— A (93) 



and 



Now, we compute A from (|85[l. Using (|88|). we add the exponents of the exponential terms 
and after some elementary computations, we obtain 

with 

3=0 

^' = G) C2~^[ G^V)) (^) ) I 1 " ^)) • (%) 
So the proof is complete. □ 

Remarks. 

1. Here, we have two representations of tp^ (u^ )■ One has scale parameter j3 and a 
mixture parameter with infinite length. The second has scale parameter t(ct) and a 
finite length mixture parameter. The latter appears as a minimal representation of 
this distribution in a sense that we try to clarify (work in progress). 
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2. By Propositions 14.21 and 14.41 we see that both the exact filter and the prediction filter 
evolve in the extended class !p( s \ If the initial distribution of the signal is in the 
subclass J-'i {e.g. if the signal is in stationary regime), it has a mixture coefficient 
with finite length (see (|74|)). Then, the number of mixture components grows of a 
unit at each iteration but remains finite. However, the numerical simulations that we 
have done in Genon-Catalot and Kessler (2004) show that there are only two or three 
significantly non nul mixture coefficients. Some stability results are also obtained that 
may be extended to the model investigated here. 

5 Stochastic volatility type models. 

We first draw some immediate consequences of the previous section. Then, we introduce 
some new type of stochastic volatility models. 

5.1 Scale perturbation of a Cox-Ingersoll-Ross diffusion process. 

Consider now the model obtained by taking squares of the previous one. Set 

Zn = V n = r n v n , (97) 

with r n = x\, v n = u> 2 and (x n , w n ) as in the previous section. Then r n = R n/ \ is a discrete 
sampling of the Cox-Ingersoll-Ross diffusion model (|64j) . It is a Markov chain with transition 
(see p3>) 

q ( s )(r,r') = (1/2)^-3 p(<V/ 2 y 1/2 ) 

= V>o)^ex P (-^ ) g« fe (^)^^. (98) 

(see ()60|) for (otk(x)). This is now a mixture of Gamma distributions with parameters 
(k + 2, 2^-)- 

The distribution of the noise (v n ) is now inverse exponential. And the class of distribu- 
tions is composed with mixtures of Gamma distributions with parameters (i + |, ^j), for 
i > 0. The filtering and prediction algorithm can be explicitely expressed with the same 
formulae after some simple changes z = y 2 for the observations and the change of variables 
x = r2 for the distributions. 

Note that another computable filter is obtained by setting 

z n = — = —v n , v n = — . (99) 

Zn 1 n Un 

5.2 Stochastic volatility type models. 

The above considerations lead to some new type of stochastic volatility models. Indeed, 
stochastic volatility models usually postulate that the observed price process (S n ) of an 
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asset is such that 

Z n = log^±i = s/V n e n , (100) 

where (V n ) is a positive Markov chain (the unobserved volatility), (£ n ) is a sequence of 
i.i.d. standard Gaussian variables, the two sequences being independent. We do not know 
explicit filters for such stochastic volatility models when the signal is a discrete sampling of 
a diffusion process. 

Now, taking squares in ()100|) . we get that Z\ = V n e^, where e\ is distributed as a 
X 2 (l) = (7(1/2, 1.2). Our previous study suggests to replace the (7(1/2,1/2) distribution 
by a G(1,X) (possibly a G(k,X) with k integer). More precisely, the following stochastic 
volatility type models will provide explicit filters through a symetrization device. Consider 

z 'n = Vr^e'm or z n = -J= e 'L ( 101 ) 

with r n = R ni \ a discrete sampling of a Cox-Ingersoll-Ross diffusion. For the noises, consider 
a symetric Bernoulli variable e±l with probability 1/2, independent of a random variable T 
having distribution (7(1, A) (exponential distribution). Then, assume that e' n is distributed 
as ^= and that e" is distributed as ey/T. Then, the two models of filtering given in (jlOlj) 
can be solved explicitely. 



6 A discretized Cox-Ingersoll-Ross diffusion and condition- 
ally Poisson observations. 

Models which have no representation as y n — H{x n ^ w n ), for some simple function H, are 
also of interest. These models are completely specified by the conditional distribution (fljl 
and the transition operator of the hidden Markov chain. We investigate below such an 
example. Suppose that the couple (signal, observation) is defined as follows. The signal 
is the process (r n ) obtained as above from a discrete sampling of the square-root model 
(|64*|) . Now, the observation y n is such that, given r n = r, y n has a Poisson distribution with 
parameter A r, i.e. 

= yK = r) = f r (y) = exp (-Ar)^^,y G N. (102) 

y' 

We can check our conditions with the class of distributions fitted with the signal (r n ), i.e. 
the class of mixtures of Gamma distributions G(i + |, ^) with parameters (i + |, ^r), for 
i > 0. Only (CI) needs to be checked. Let us set for this Gamma density 

•* m(r) = '"i 1 "^ 1 ^ (103) 

Now, 

f r (y) 7 < (r) oc exp [-(A + ^)r] r^^+D. (104) 
This is again a Gamma distribution of the same type. And we get 

t y (i)=y + i, T y {a) = {2\ + \)-^. (105) 
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The filtering and prediction algorithm will evolve in the family of mixtures of Gamma 
distributions with tail index i + (5/2), i > 0. The marginal distributions can be explicitely 
computed. 



7 Concluding remarks. 

This work has to be completed by numerical simulations. In Genon-Catalot and Kessler 
(2004), the model corresponding to a one-dimensional radial Ornstein-Uhlenbeck process is 
studied and implemented. The numerical results show that the number of significantly non 
nul mixture coefficients is less than 2 or 3. Theoretical properties linked with the stability 
of the filters are established in this paper which may be extended to the models of Section 

m 

The above results may be extended to the case of a non time-homogeneous signal. For 
instance, it is possible to consider a non regular discrete sampling of the underlying diffusion 
model (i.e. to consider x n = X tn with < t± < . . . < t n < . . .). It is also possible to consider 
non time-homogeneous conditional distributions of the observation given the signal. Note 
also that the signal may or may not be ergo die. 

Acknowledgments. The authors wish to thank Wolfgang Runggalddier and Pavel Chi- 
gansky for helpful discussions and references. 
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8 Appendix 

8.1 The extended Kalman filter 

The class T % . We compute the Laplace transform of a distribution v \ ilimcj %\ (see (|41)0. 
For X a random variable having the previous distribution, elementary computations using 

y ield > for A e C > 

E(ex.p\X)= — — exp Am + — -^). (106) 

C 2i {m + /jl; a 2 ) KV 2a 2 

All parameters (i,m, fJ>,o~ 2 ) are identifiable. ^From this formula, we can prove that, for 
all m,fi and all i, as a tends to 0, ^L m(T 2) weakly converges to the Dirac measure 5 m . 
Moreover, for any mixture coefficient a, X^>o a * v \^ma 2 ) we& kly converges also to 8 m . 

Proof of Proposition^^ Let us consider a random variable X with distribution v = v\ m a2 \ 
and let Y = hX + e with X and e independent, and e having distribution AA(0,7 2 ). Then, 
tp y (v) is exactly the conditional distribution of X given Y = y. Its density is proportional 
to: 

x - (x + exp [-(^^ + ^#)] (107) 
We compute the exponent of the exponential above and obtain: 

{y-hx) 2 (x-m) 2 (x - rh{y)) 2 

27 2 + -^2— ~ ^2 » ( 108 ) 

with: 

~ I \ m l 2 + h y<? 2 -2 2 , , 2 2 nnn\ 

m{y) = 35 , o- = 7 + h z a*, (109) 

o~ 
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and 

*» = ^. (HO) 

This implies that tp y (y) = ^, ^ -. 2 y So, we get the proposition. Note that this result 
contains the standard case where i = and = 0. 

Proof of Proposition VJ.'A As noted in the text above, in this model, the transition kernel 
p(x,x') and the conditional kernel f x (y) are of the same form. Therefore, the computations 
of vp{v) and of the marginal density p v {y) of Y are identical up to a change of notations 
((a, pi 2 ) for ipiis), and (h, r y 2 ) for p u {y))- Because of the previous proof, it is more convenient 
here to compute the marginal density of Y when Y = hX + e and (X, e) are as in the 
previous proof. We shall use the same notations as in the statement of Proposition 13.21 but 
the formulae will be given with (h, 7 2 ). We have to integrate f x {y) v(dx) (with v = v l , m a2 \) 
with respect to x. After some elementary computations, we obtain: 

(y — T?l) 2 

p v {y) = A 2i exp [ ^ — 1> rh = hm, (111) 



where (see (jTMll - ltTTUl) ) 

Let us set (see 



C 2i (n + m(y);o- 2 ) 
A 2i = n i — : 2\~- (H2) 



Thus, 



- m-f 2 + fia 2 

ha 2 ■ (U4) 

ha 2 p, + fhfj, + ma 

H + m{y) = -To- (y + M), — = — = 7—- (115) 



After some computations, we obtain 

1 ^ ^'^ /" hrr 2 y h ( ;/ -h //) 

k=0 



I2i 



-B,; f-( Vfc7 V a 2 J Co k a 2k 



Now, we set 



and 



0-k • 



Finally, for A; = 0, . . . , i, we obtain the following mixture coefficients: 

'i\ 7 2 ^- fc ) ^ fk\ (/x + m) 2 ^ ^ 2 ( fe -i)CT 2 ( fe -i) 
i=o 



And 



So the proof is complete. 



k=0 



(116) 



<*k = Jf-^i a k,i- (H8) 



t ;irgQ ? V a • <»»> 



pAy)dy = J2*k ) "(^)(dy)- (120) 
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8.2 The radial Ornstein-Uhlenbeck process. 



8.2.1 Gaussian moments, Gamma function. 

Let us set, for a > 0, and X a standard Gaussian variable, 

C Q =E(|Xn (121) 
And recall the definition of the usual Gamma function 

/•+oo 

r(o)=/ x a - x e~ x dx, a > 0. (122) 
The following relations are obtained by elementary computations. 

p/ q+l \ /2~7r 

C tt+1 = QC a _i,«>l, C a = J.V^ a^O, r(a) = ^— C 2a _i,a> 1/2. (123) 

^/2^2— 2 

Thus, when a = 2i, i £ N, i.e. a is an even integer, we obtain, Cq = 1 and for i > 1, 

C 2l = (2i - 1)0-2^!) = (2i - l)(2i - 3) . . . 5.3.1 = (124) 



8.2.2 Transition densities. 

For an integer 5 > 1, consider processes , . . . , £*) satisfying for all j: 
where (W/) are independent Wiener processes. Let us set Rt = X^=i£i) 2 > = Rt 



^ v. - F? 1 / 2 

By the Ito formula, we obtain dRt = Sf=i 2 £i? + 5a 2 dt. By Levy's characterization, the 
process defined by 



is a standard Brownian motion. And, 



o 



X t 



dR t = (28R t + 5a 2 )dt + 2aR] ,2 d(3 t . (125) 

Therefore, the process (i?t) is the classical Cox-Ingersoll-Ross diffusion process. Another 
application of the Ito formula gives the stochastic differential of (Xt): 

dX t = (9X t + ^ )<& + <rd#. (126) 

2 At 

Now, we do not assume any more that 5 is an integer. We assume in the stochastic differential 
equations (|125|) and (|126|) that 5 is a real parameter satisfying 5 > 1 and define the index 
i> = (5/2) — 1. When = and a = 1, (X t ) is the standard Bessel process with index za The 
scale and speed densities of (Xt) are obtained by the classical formulae for one-dimensional 
diffusion processes. The scale density is given by 

9 r x (ft — i fl r 2 

s(x) = exp(-4/ (0u + - ^)dw oc x"^- 1 ) exp(-^V). (127) 

a A J 2u o 2. 
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The speed density is m(x) = s~ 1 (x). The diffusion process ()126[) is positive recurrent on 
(0, +00) for 9 < 0. In this case, its stationary density is obtained by normalizing m into a 
probability density. Setting 



a 

P 



(2|fl|)V2' 
we obtain the stationary density 

7T^(x)= l x>0 ^=exp(-^)(-) (128) 
pV2vr 2p l \pj Cs-i 

This is the distribution of T 1 / 2 with T having Gamma distribution G(5/2, l/2p 2 ). 

The processes (X t ) and (Rt) have explicit transition probabilities with densities with 
respect to the Lebesgue measure on (0, +00). For these, we refer e.g. to Karlin and Taylor 
(p. 333-334). For the properties of Bessel functions that we use, we refer e.g. to Nikiforov 
and Ouvarov (1983). The conditional density of given Xq = x is as follows: 



12 

p(A, x, x') = P W ( X) x>) = 2 x l (x , >0) (x') 6 - 1 exp (^-) x 



O 20A \ a 

«P ( -^(pATT))^+^))( ff2(e ,A _ 1) )(^^ A )^X 

I u (xx'e 6t W ^ 



a 2( e 28t_ 1 y 

where I u (z) is the Bessel function with index v. This function is given by the following 
series development 

^)=(i)^E(D 2t fc!r(t ^ +1) . p») 

where V is the usual Gamma function. Now, we use the notations (|54jl and the relations 
(|123l) to transform p^ (x, x') and obtain: 

2 x' 2 x '5-l+2k 

p^M = h ^ — ^ { -—) T,a k (x) Cs _ i+2k(]S _ 1+2k , (130) 
where the mixture coefficients are given by (see (j6Uj) ) 

OA; (£J = exp I 



2/3 2 ' k\ V 2/3 : 



Now, when 5 = 2n + 1, the index is ^ = n — ^ a half integer. Then, the Bessel function 
is explicit and equal to: 

/ 2 \ 1/2 

J n _ 1 (z) = — z n T"(cosh(z)) (131) 
2 J 

where T n = T o . . .Tis the n-th iterate of the operator T(f){x) = f'(x)/x. And we obtain 
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8.2.3 Technical lemma. 

Proof of Lemma \4-l\ 

Let us set 

C 1 
= 2(i+fe)+g-i = (tf— l + 2fc + 2i — l)(<y— l + 2fc + 2i — 3). . . («5-l + 2fc + l). 

^2fc+5-1^2i+<5-l (^2j+<5-l 

(132) 

Hence, y?(x) is a polynomial of degree i which admits a unique representation as a sum of 
the elementary polynomials 1, x, x(x — 1), ... , x(x — 1) . . . {x — i + 1), say 

</?(x) = a' + a' x x + a! 2 x{x — 1) + . . . + a i x(x — 1) . . . (x — i + 1). (133) 

Let us set 

if)(x) = ao + aix + 02x(x — 1) + . . . + ajx(x — 1) . . . {x — i + 1) (134) 

with the coefficients a,j given in the statement of Lemma 14.11 We will prove that ip and ip 
are identical. For this, it is enough to check that 

¥>0') = i>(j) =ao+ jai + j{j - l)a 2 + ■■■+ jlaj for all j = 0, 1, . . . , %. (135) 
Computing the constant and the higher degree terms, it is easy to see that 

1 , / 2 * 

(p(0) = ao = — and (Zj = Oj 



2 1 

W-l W-l+2i 

Now, let us fix < j < i. Then, ^p(j) and have the following expressions: 

Cs-l+2(i+j) 



-1+2 j -l+2i 

= -r^ (5 -l + 2j + 2i- 1)(5 - 1 + 2j + 2i - 3) . . . (S - 1 + 2% + 1) 

W-l+2j 



PfJM (136) 



1 2i , ,2 2 i(i - 1) 
4>ti) = TT- + Jt; + J'O " 1)^ " 

W-l W-l+2 * W-l+4 

+ J (;-W-2) 2 ^-^ 2 ' + ... 

*y?l)...(«-j + l) _ qg 

J ! W- 1+2.? 

Hence, both quantities are polynomials of degree j as functions of the variable i. 
We now prove that, for all j, all 5 and all y, 

P^ 1 (x) = Q i £ 1 (y), (138) 
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with 



Pl My) = Qf\(y) = -^-, (139) 

W-l 



and 



,0') ' * 1 



P£iM = n & + S-l + 2j-l)(2y + 6-l + 2j-3)...(2y + 5-l + l) (140) 



^ W W-1+2A: 

Let us first look at Pg_i(y). Using Cs-i+2j = (S — l + 2j — l)C$_i + 2(j-i), we get 

(v) = Pti 1] (y) + 2y J P^ 1) (v - 1) (ua) 

Now, we look at Q^l^y)- Using the relation 

(£RV) + (i:i)' <-> 

we obtain 

Qf-M = Qti\v) + iyQti\y ~ !)• ( 144 ) 

Therefore, both families of polynomials satisfy the same relation (|142j) . Since (jLfflj) holds, 
we get (|138|> . So, the proof of the Lemma is now complete. 
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